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Abstract 

In the so called lightbulb process, on days r = 1,... ,n, out of n lightbulbs, all 
initially off, exactly r bulbs selected uniformly and independent of the past have their 
status changed from off to on, or vice versa. With Wn the number of bulbs on at the 
terminal time n and C„ a suitable clubbed binomial distribution, 

dTY{Wn,Cn) < 2.73U,/7ie''^''+^^/^ for all n > 1. 

The result is shown using Stein's method. 

1 Introduction 

The lightbulb process introduced by ^ was motivated by a pharmaceutical study of the 
effect of dermal patches designed to activate targeted receptors. An active receptor will 
become inactive, and an inactive one active, if it receives a dose of medicine released 
from the dermal patch. On each of n successive days r = 1 , . . . , n of the study, exactly 
r randomly selected receptors will each receive one dose of medicine from the patch, 
thus changing, or toggling, their status between the active and inactive states. We 
adopt the more colorful language of [3] , where receptors are represented by lightbulbs 
that are being toggled between their on and off states. 

Some fundamental properties of Wn, the number of light bulbs on at the end of day 
n, were derived in [3]. For instance, Proposition 2 of [3] shows that when n(n + 1)/2 = 
0mod2, or, equivalently, when nmod4 S {0,3}, the support of Wn is a subset of even 
integers up to n, and that otherwise the support of Wn is a set of odd integers up to n. 
Further, in [3], the mean and variance of Wn were computed, and based on numerical 
computations, an approximation of the distribution of Wn by the 'clubbed' binomial 
distribution was suggested. 

To describe the clubbed binomial, let Zn be a binomial Bin(n — 1,1/2) random 
variable, and for i G Z let vr* = P{Zn = i), that is 

forz = 0,l,...,n-l, 
otherwise. 
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Let Li^n and Lo,n denote the set of all odd and even numbers in {0, 1, . . . , n}, respec- 
tively. Define, for m = 0, 1, 



0, i ^ LfYi n- 



Summing binomial coefficients using 'Pascal's triangle' yields 



-'m,n- 



We say that the random variable Cm,n has the clubbed binomial distribution ii P[Cm,n — 
i) = TT™ for z e L 

m,n- In words, the clubbed binomial distribution is formed by com- 
bining two adjacent cells of the binomial. 

It was observed in [3j that the clubbed binomial distribution appeared to approxi- 
mate the lightbulb distribution Wn exponentially well. Here we make that observation 
rigorous by supplying an exponentially decaying bound in total variation. First, recall 
that if X and Y are two random variables with distributions supported on Z, then the 
total variation distance between the (laws of) X and Y, denoted dTv{X,Y), is given 

by 

dTY{X,Y) = sup \P{X e A)- P{Y eA)\. (2) 

Acz 

Theorem 1.1 Let Wn be the total number of bulbs on at the terminal time in the 
lightbulb process of size n and let Cn = Cm,n where m = for nmod4 G {0,3} and 
m = 1 /or nmod4 E {1, 2}. Then 

In particular, the approximation error is less than 1% for n > 21 and less than 0.1% 
for n > 28. 

A Berry-Esseen bound in the Kolmogorov metric of order l/\/n for the distance 
between the standardized value of Wn and the unit normal was derived in |2j. The 
lighbulb chain was also studied in [3], and served there as a basis for the exploration 
of the more general class of Markov chains of multinomial type. One feature of such 
chains is their easily obtainable spectral decomposition, which informed the analysis 
in [2]. In contrast, here we demonstrate the exponential bound in total variation using 
only simple properties of the lightbulb process. 

After formalizing the framework for the lightbulb process in the next section, we 
prove Theorem 1 1.1 1 bv Stein's method. In particular, we develop a Stein operator A for 
the clubbed binomial distribution and obtain bounds on the solution / of the associated 
Stein equation. The exponentially small distance between Wn and the clubbed binomial 
Cn can then be seen to be a consequence of the vanishing of the expectation of Af 
except on a set of exponentially small probability. 



2 The lightbulb process 

We now more formally describe the lightbulb process. With n E N fixed we will let 
X = {Xrk : r = 0,1, ... ,n,k = 1, . . . ,n} denote a collection of Bernoulli variables. For 
r > 1 these 'switch' or 'toggle' variables have the interpretation that 

^ _ f 1 if the status of bulb k is changed at stage r, 
I otherwise. 
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We take the initial state of the bulbs to be given deterministically by setting the switch 
variables {Xq/., k = 1, . . . ,n} equal to zero, that is, all bulbs begin in the off position. 
At stage r for r = l,...,n,rof the n bulbs are chosen uniformly to have their status 
changed, with different stages mutually independent. Hence, with ei, . . . , G {0, 1}, 
the joint distribution of Xri, ■ ■ ■ , Xm is given by 

p(Y - f> Y (r)""^ if^i"^ he„ = r, 

-r(,Arl — ei, • • • , A^n — e„j — < ' 

[ U otherwise, 
with the collections {X^i, . . . , X^n} independent for r = 1, . . . , n. 

Clearly, at each stage r the variables (X^i, • • • ^Xm) are exchangeable. 

For r,i = 1, . . . ,n, the quantity (X]I=i ^si) mod 2 is the indicator that bulb i is 
on at time r of the lightbulb process, so letting 

(n \ n 

Y,^ri \ mod 2 and M/„ = ^/i, 
r=0 / i=l 

the variable li is the indicator that bulb i is on at the terminal time, and Wn is the 
number of bulbs on at the terminal time. 

The lightbulb process is a special case of a class of multivariate chains studied in 
[1], where randomly chosen subsets of n individual particles evolve according to the 
same marginal Markov chain. As shown in [4J, such chains admit explicit full spectral 
decompositions, and in particular, the transition matrices for each stage of the lightbulb 
process can be simultaneously diagonalized by a Hadamard matrix. These properties 
were applied in [3] for the calculation of the moments needed to compute the mean 
and variance of Wn and to develop recursions for the exact distribution, and in |2j for 
a Berry-Esseen bound of the standardized Wn to the normal. 



3 Stein Operator 

In order to apply Stein's method, we first develop a Stein equation for the clubbed 
binomial distribution Cm,n and then present bounds on its solution. With vr!^ given 
by dU), let 7r™(A) = E^eA^™- Set = (n - x){n - 1 - x) and = x{x - 1) for 
X G {0, . . . , n}. One may easily directly verify the balance equation 

a^_2vr;^_2 = /3x7r™ for x £ Lm,n, (3) 

which gives the generator of the distribution of Cm,,n as 

Af{x) = a^f{x + 2) - for x £ Lm,n- (4) 

For A C Lm,n-, we consider the Stein equation 

AfA{x) = Ia{x) - Ti'^iA), x G (5) 

For a function g with domain A let \\g\\ denote sup^g^ 15(2^) I- 

Lemma 3.1 For m G {0, 1} and A = {r} with r G Lm,n, the unique solution /™(x) of 
^ on Lm,n satisfying the boundary condition f^{m) = is given, for m < x < n,x £ 

Lm,nj by 



7r'"([0,a:-2]nL^,nX 

-5 — m for r + 2 < X < n. 



m ) 13 m < X < r + 2 
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Furthermore, for all A C Lm^n, /X(^) ~ J2reA fr^i^) ^•^ ^ solution of ^ and satisfies 

„,^„ 2.7314 
/X forn>l. 

Lemma l3. II is proved in Section HI 

Applying Lemma |3.H we now prove our main result. 

Proof of Theorem Fix m G {0, 1} and A C Lm,n, and let / := /™ be the 
solution to (0). Dropping subscripts, let W = Y^^=i^ii where li is the indicator that 
bulb i is on at the terminal time. For i,j £ now with slight abuse of 

notation, let Wi = W — li, and for i / j set Wij =W — U — Ij. Then 

E{n -W){n-1- W)f{W + 2) 

n 

= EY,{'^-h){n-l-W)f{W, + 2) 
1=1 



Sj](l-/,)(l-/,)/(H^,, + 2), 



and similarly. 



EWiW - l)fiW) = eY, hWifiWi + 1) = eY, hIjf{Wi + 1) = E ^ hljfiW.j + 2). 

i=l i^j i^j 

By Proposition 2 of [3j, P{W £ Lm,n) = and hence ^ holds upon replacing x 
by W . Taking expectation and using the expression for the generator in ([4]), we obtain 

Pi^W £A)- T,^{A) = EAf{W) = ((1 - h){l - Ij) - hlj) fiWij + 2). (7) 

Recalling that Xrk is the value of the switch variable at time r for bulb k, let Aij be 
the event that the switch variables of the distinct bulbs i and j differ in at least one 
stage, that is, let 



(8) 



r=l 



Now using ([7|) we obtain 



\P{W G ^) -7r"^(yl)| 



((1 - /,)(! - W - hij) fiW.j + 2) 



< 



^E{il- m - Ij) - Id,) f{W,, + 2)1a, 



+ 



Y, E{{i- mi - ij) - hi,) fiWij + 2)i^c^ 



(9) 



Note that Ii,Ij G {0, 1} implies 

(l-/,)(l-^j)lW=0 = /^/,l/,^7,, 
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and hence for the first term in ([9]) we obtain 



^ ((1 - - Ij) - hi,) f{W,j + 2)U 



((1 - h){l - I,) - IJj) f{W,, + 2)U,,/,=7,. 



(10) 



For a given pair on the event A^j let t be any index for which X^j ^ Xfj, and 
let X*-' be the collection of switch variables given by 



( Xrk r ^t, 

Xtk r = t,k ^ {i,j}, 
Xti r = t,k= j, 



r = t,k 



I. 



In other words, in stage t, the unequal switch variables Xu and Xtj are interchanged, 



and all other variables are left unchanged. Let be the status of bulb k at the terminal 



time when applying switch variables X*-', and similarly set W-j = J2ki^{i j} ■ Note 
that as the status of both bulbs i and j are toggled upon interchanging their stage t 
switch variables, and all other variables are unaffected, we obtain 



l-L 



1 - Ij and W-^ 



W, 



With Xli 



In particular, /j = Ij if and only if = /j-', and, with A^^- as in 
replacing X^k-, we have additionally that A^^ - = Aij. Further, by exchangeability we 
have £(X) = £(X^J). Therefore, 

S(l-I,)(l-/,)/(H^i,+2)lA.„/,=7, 

= E{l-lf){l-lJ)f{wij+i)l^,j.,^^., 

= Eia,f{W,,+2)lA,,M=i„ 
showing, by (jlOh . that the first term in ([9]) is zero. Therefore, 
\PiW £ A)-tt'^{A)\ 



< 



i+3 



< 



As A\- is the event that the switch variables of i and j are equal in every stage, recalling 
that these variables are independent over stages we obtain 



P{A 



I]) 



■j^ r(r — 1) + (n — r)(n 



r=l 



n > 



r=l 



n(n — 1) 

2(nr — r^l 



n[n 
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< 



-(n+l)/3 



Hence, by Lemma |3. 11 



2 7*^1 4 

P{W G ^) - 7r'"(A)| < -n(n - l)e-("+i)/=^ = 2.7314V^e-("+^)/3^ 



^/n{n - 1) 

Taking supremum over A and applying definition ([2|) completes the proof, 



□ 
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4 Bounds on the Stein equation 

In this section we present the proof of Lemma 13.11 

Proof: Let m G {0, 1} be fixed. First, the equaUties /(m) = and 

J (x + 2) = for m < X < n — 2, X e Lm n 

ax 

specify /(x) on Lm,n uniquely, hence the solution to ([5]) satisfying the given boundary 
condition is unique. 

Next, with r S Lm,n) we verify that /^(x) given by ([6]) solves ([5]) with A = {r}; 
that f^{m) = is given. For m < x < r, x £ Lm,n, applying the balance equation ([3]) 
to obtain the second equality, we have 

a,/r(x + 2)-/3,/r(x) 

/ 7r"^([0,x]nL^,nX ^ _^ f 7r"^([0,x - 2] n 



7r"'([0,x]n _ f 7r"([0, X - 2] n Lm,n)K 



If X = r then 



a,/r(x + 2) - /3./r(x) 

^ ^"^([r + 2,n]nL^,^X >^ ^ ^^-^'"([0, r - 2] n L^,^)7r; 

— '^f o _m A 



/7r'"([r + 2,n]nL^,„X\ ^ ,^-^™([0, r - 2] n V„)< 



7r'"([r + 2, n] n + 7r^^{[0, r - 2] n L^,^ = 1 " 



If X > r then 



a,/r(x + 2)-/3,./,'"(x) 

/7r'"([x + 2,n]nL™,„X\ ,^^-([x, n] n 

= «^ o — ] -Px' 



V"([x + 2,n]n ,^^™([x,n]nL^,„)^,^ 



-IT. 



m 

X 

m 



Hence /^(x) solves ([5]). 

Next, to consider the solution of ([5]) more generally for A C -Lm,n aud x G -Z>m,n) 
letting 

f^m,x = [0, X — 2] n Lm^n aud C^m,a; ~ -^m,r), \ Um,xj 

we may write ([6]) more compactly as 

r (x) = ^ (vr™(C/;^,Jvr™({r} n C/^,.) - 7r-(C/^,,.)vr"^({r-} n C/;^,J) . 

Px~x 
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By linearity, the solution of ([5]) for A C Lm,n is given by f^{m) = 0, and for x > 

m,x £ Lm,n, by 

/a (^) = ^ {TT^iU^^^TT^iA n Um,.) - 7r-(C/„,,.)vr™(A n f/;^, J) 
(cf [1], p. 7), and so, for all x G L^.n, 

^ vr-([/„,,.)vr-([/;^,,) < < -i-vr-([/;^,j7r-(C/„,.), 



or that 

\mx)\ < ^vr-(C/^,.)7r-(C/;^,J. (11) 

Since /™(m) = and the upper bound of Lemma l3.ll reduces to oo if < n < 1, 
we only need to bound f^{x) for n > 2 and x > 2. Direct computation using (|lip 
gives 1/0(2)1 < 1/4 for n = 2, |/0(2)| < 1/8 and |/i(3)| < 1/8 for n = 3, |/0(2)| = 
|/°(4)| < 7/96 and \f\i3)\ < 1/12 for n = 4. Therefore, it remains to prove Lemma [3. II 
for n > 5. 

Noting that for x > | + 1 we have Px > + l) §, and for x < ^ + 1 that 
Gix-2 = {n — X + 2){n — X + 1) > (f + l) f , using ([3]), we obtain from (fTT|) that 

{ 7r"'([/m,3:)7r'"(C/^^j,) j (t/m.a: )7r"' (f7^^ j. ) :f ^ > _|_ 1 

".-2^™ 2 - (f+i)5 2 n X < 2 + i. 

Clearly, for i > x, 

tt"^ _ (") _ J 1 if i = X 

^ ^ (!) ^ 1 ifz>x + 2. 

Hence, we can write, for i > x + 2, 



Note that as (n — x)/(x + 1) < 1 for x > n/2, the terms in the product (|13p are 
decreasing. In particular, 

— ^ < 1 for i > X, and < 1 provided x > —. (14) 

o<y<l^\ 

For n even let Xg = n/2, and for n odd let x^ = (n — l)/2 when m = 0, and 
X5 = (n + 1)/2 when m = 1. Then, except for the case where m = and x = (n + 1)/2, 
which we deal with separately, we have 

7r"'{Um,x)T^"'{U^^x) = '^"'iUm,2xs-x+2)'^"'{U^^2xs-x+2)^ 

and we may therefore assume x > x^ + 1, and so x > n/2 + 1. 
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Since for y > -v/n/2, recalling x > n/2 + 1, we have 
n — X — y n 



< 



\/n n _ y/n_ _ i 
2 _ 2 2 



§ + 2+ 2 



applying (|T3|) and (|M|) we conclude that 



< 1 



for i > X + [^J + 1. 



Hence, applying (|M|) again, here to obtain the second inequality, we have 

1 



(§ + !)§ vr- 



< 



< 



1 



(§ + !)§ 
1 



\x<j<x+[^J,ieL- 



\ 



(f +1) 



n 



i=o 



2+2 + ^, 



1 



< 



(§ + !)§ 
2.7314 



jn 



V 



1 



1 



\/n+3 



t+2+ 2 



for n > 1. 



\/n(n - 1) 

This final inequality is obtained by determining the maximum of the function 

/ \ 



1 



n 



+ 1 + 



1 



v 



1- 1 



Vn+3 
t+2+^ 



\/n{n — 1) 



by noting gi{n) < 1 + + ^J^^-j^^^^^^ < 2.5 for n > 64 and ma.xi<n<63 gi{n) = 
gi{9) = 2.7313131.... 

Lastly we handle the situation where n is odd, m = and x = (n+l)/2 =: xq € Lo,n, 
in which case n = 3 mod 4. In place of (jlSp . we have, for y > \/n/2, 



(15) 



(16) 



xo + 1 + y ^ (li+i) + 1 + n + 3 + ^' 



n - xo - y ^ 



2 / ' ^ ' 2 

Since (|14p is valid for all x > n/2, in view of ()13p we obtain the bound 



< 1 



vr'" 



4 + 2^71 
n + 3 + ^/n 



for i > xo + L^J + 1- 
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Using ()14p again for the first inequality we have 

1 vr-([/„,,j7r-([/;^_,J 



vr'" 




TT'- v-^ IT- 



y. —+ y 



V n , n + 3 + ^ 
§ + l)nV~ 4 + 2^^ 

1.638496535 , , 
\/n(n - 1) 

where the last inequality is from bounding the function 

( \ 1 ( ^ ^ ^ , ^^ + 3 + \/n\ ^ 

^ ^ ^ (f + 1) n V 4 4 + 2^^ ; ^ ' 

with g2{n) < ^ + + "^^^ < 1-6 for n > 400 and maxi<„<399 5r2("-) = 52(23) = 
1.638496535. 

The result now follows from combining the estimates (I16p . ()17|) and ()12p . □ 
We remark that a direct argument using Stirling's formula for the case x = [ra/2j 
shows that the best order that can be achieved for the estimate of is 0{n^^^^). 
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